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DNA Sequence s Encoding the Human Al. A2a and A2b 
AfleflosAne RecgptQ^s 
Field-of the I nvention 

The present invention relates to DNA sequences 
5 encoding the human Al r A2a and A2b adenosine receptors. 
In addition , the present invention relates to the use of 
these DNA sequences in the production of the human Al, A2a 
and A2b adenosine receptors using recombinant DNA 
technology. 

10 Background of the Invention 

Adenosine influences cardiovascular function (by 
slowing heart rate and decreasing blood pressure) and also 
influences nervous system function (through sedative and 
anti-epileptic effects). In addition, adenosine can 

15 induce bronchoconstriction. Adenosine binds specifically 
to at least three receptors , Al and A2a and A2b. 
Adenosine receptors have been shown to couple to a number 
of second messenger systems. Additional adenosine 
receptor subtypes may exist. As adenosine receptor 

20 agonists and antagonists may have commercial value as 

anti-hypertensive agents, hypnotics, anti-psychotics t-d 
bronchodilators , the ability to produce adenosine 
receptors by recombinant DNA technology is advantageous. 
The present inventors have isolated three related 

25 cDNA fragments encoding the human Al, A2a and A2b 

adenosine receptors from human hippocarapal cDNA by using 
either the. polymerase chain reaction and unique degenerate 
oligonucleotides to generate specific probes or by using 
specific consensus oligonucleotide probes for cDNA library 

30 screening. Pull-length cDNA clones for each of the three 
receptors were isolated from a human hippocampal cDNA 
library. The receptor sequences were identified as the 
human Al f A2a and A2b adenosine receptors by expression in 
mammalian cells and both measurement of the affinity of 

35 the encoded receptors for various adenosine analogues and 
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the effect of receptor activation on cAMP synthesis. The 
receptors have homology to cDNA's encoding the dog Al and 
A2a adenosine receptors (MAENHAUT, C, VAN SANDE, J., 
LIBERT, F., ADRAMOWIC, M. , PARMENTIER, M., 
5 VANDERHAEGEN, J., DUMONT, D., VASSART, G. AND 

SCHIFFMANN, S. (1990); LIBERT, F., SCHUFFMANN, S.M., 
LEFORT, A., PARMENTIER , M. , GERARD, C, DUMONT, J.E., 
VANDERHAEGHEN J.J., VASSART, G. (1991)) and the rat A2b 
adenosine receptor (STEHLE, J.H., RIVKEES, S.A., 
10 LEE, J.J., WEAVER, D.R., DEEDS, J.D. AND REPPERT, S.M. 

(1992)). These hippocampal cDNA sequences represent novel 
human receptors which may be of clinical and commercial 
importance . 

Summary of the Invention 

15 Accordingly, in a first aspect the present invention 

consists in a DNA molecule encoding the human Al adenosine 
receptor, the DNA molecule having a sequence substantially 
as shown in Figure 1 or a functionally equivalent sequence. 
In a second aspect the present invention consists in 

20 a DNA molecule encoding the human A2a receptor subtype, 
the DNA molecule having a sequence substantially as shown 
in Figure 2 or a functionally equivalent sequence. 

In a third aspect the present invention consists in a 
DNA molecule encoding the human A2b adenosine receptor 

25 subtype, the DNA molecule having a sequence substantially 
as shown in Figure 3 or a functionally equivalent sequence. 

As used herein the term "functionally equivalent 
sequence" is intended to cover variations in the DNA 
sequence which, due to degeneracy of the DNA code, do not 

30 result in the sequence encoding a different polypeptide. 
Further, this term is intended to cover alterations in the 
DNA code which lead to changes in the encoded polypeptide, 
but in which such changes do not affect the biological 
activity of the polypeptide. 

35 As used herein the term "DNA molecule" is intended to 
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cover both genomic DNA and cDNA. 

In a fourth aspect the present invention consists in 
a method of producing the human Al adenosine receptor 
comprising culturing a cell transformed with the DNA 
5 molecule of the first aspect of the present invention 
under conditions which allow expression of the DMA 
sequence such that the human Al adenosine receptor is 
expressed on the cell surface and optionally recovering 
the human Al adenosine receptor. 

10 In a fifth aspect the present invention consists of a 

method of producing a human A2a adenosine receptor 
comprising culturing a cell transformed with the DNA 
molecule of the second aspect of the present invention 
under conditions which allow expression of the DNA 

15 sequence such that the human A2 adenosine receptor is 
expressed on the cell surface and optionally recovering 
the human A2a adenosine receptor. 

In a sixth aspect the present invention consists of a 
method of producing a human A2b adenosine receptor 

20 comprising culturing a cell transformed with the DNA 
molecule of the third aspect of the present invention 
under conditions which allow expression of the DNA 
sequence such that the human A2 adenosine receptor is 
expressed on the cell surface and optionally recovering 

25 the human A2b adenosine receptor. 

In further aspects the present invention consists of 
a method of screening a molecule for adenosine agonist or 
antagonist activity* comprising contacting the molecule 
with the human Al, A2a or A2b adenosine receptors produced 

30 by the method of the fourth, fifth or sixth aspect of the 
present invention. 

In yet a further aspect the present invention 
consists in oligonucleotides 305 , 377 and 376 as 
hereinafter described. 

35 The DNA molecules of the present invention represent 
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novel human receptors. These receptors may be of interest 
both clinically and commercially as they are expressed in 
many regions of the body and as adenosine affects a wide 
number of systems* 
5 The isolated full-length OKA clones containing the 

complete coding region for these receptors can be used to 
establish mammalian cell lines producing the receptors for 
use in agonist and antagonist screening. The receptor DNA 
sequence can be used for additional homology screening to 
10 identify novel members of this receptor family. 

In order that the nature of the present invention may 
be more clearly understood preferred forms thereof will 
now be described with reference to the following examples 
and figures in which:- 
15 Figure 1 shows the nucleotide and amino acid sequence 

of the human Al adenosine receptor cDNA. 

Figure 2 shows the nucleotide and amino acid sequence 
of the human A2a adenosine receptor cDNA. 

Figure 3 shows the nucleotide and amino acid sequence 
20 of the human A2b adenosine receptor cDNA. 

Figure 4A shows saturation isotherms of the total 
(unfilled triangle), specific (filled circle) and 
non-specific (unfilled square) binding of the Al adenosine 
receptor antagonist DPCPX (8-cyclopentyl-l, 3 
25 dipropylxanthine) to mammalian CH0.K1 cells expressing the 
human Al adenosine receptor. 

Figure 4B shows competition binding curves showing 
the displacement of CGS-21680 

2-p- ( 2-Carboxyethyl ) phenethylamino-5 ■ -N- e t hyl car boxy amido 
30 adenosine hydrochloride) by different adenosine agonists 

and antagonists (NECA = 5 1 -N-ethylcarboxamido adenosine; 

CA»2 -chloroadenos ine ; CPA=N 6 -eye lopenty ladenos ine ; 

XAOxanthine amine congener; T«8-(p-sulphophenyl)- 

theophylline) in mammalian HEK 293 cells expressing the 
35 human A2a adenosine receptor. 
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Figure 5 shows the effects of the different adenosine 
receptor subtypes, Al f A2a and A2b upon cyclic AMP 
production. Al adenosine receptor activation leads to 
inhibition of forskolin stimulated cAMP levels. 
5 Activation of both the A2a and A2b adenosine receptors (by 
CGS-21680 and NECA, respectively) leads to stimulation of 
cAMP levels. 
METHODS 

Olloonucleoti de Desion and Synthesis 

10 Unique degenerate oligonucleotides corresponding to 

the transmembrane II (TM II) and IV (TM IV) regions of G 
protein-coupled receptors and containing either a 5 ' EcoRI 
restriction enzyme site (TM II oligonucleotide 377) or a 
3' Hind III restriction enzyme site (TM IV 

15 oligonucleotides 305 and 376) were synthesized on an 
Applied Biosys terns automated DMA synthesiser. The 
sequences of the oligonucleotides are as follows \t- 

305 5' - CCCAATAAGCTTAGICCIATGGCGAAAGACAGGACCCA-3 ' 
20 A A G G C 

A A 

376 5' - GAGTCCGAAGCTTAGTGGGCAAGAGATGGCGAAIGAIAGIACCA-3 1 

G TA C A G 

25 T A 

377 5' - CAGAACGAATTCAATGTTTTTATGTGGTCTTTGTCITCIACTGA-3 ' 

CGG G G G C 

A 

30 

The DNA sequences included inosine (I) residues. 
Crude oligonucleotides were then used in the polymerase 
chain reaction. 
PCR Amplification 
35 Sequences homologous to the G protein-coupled 
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receptor oligonucleotides were amplified from human cDNA 

using PGR and the Hybaid thermocycler . DNA was prepared 

from -a human neuroblastoma (Clontech) cDNA library in 

lambda gtlO and from a hippocampal (Stratagene) cDNA 

5 library in lambda ZapII. DNA was prepared by phenol and 

p 

chloroform extraction of approximately 10 library phage 
and ethanol precipitation to recover the DNA. DNA from 
the cDNA libraries (l-5pg) was incubated with 200/jM of 
each dNTP, O.SjiM oligonucleotide , 0.5 units Tth enzyme 

10 (Toyobo) in 50mM KC1, 50mM Tris-HCl pH9.0, 1.5mM MgClj 

(1 x PCR buffer) in a 50/iL reaction volume. Samples were 
layered with 50jiL light mineral oil (Sigma). 
Reactions were denatured for 5 minutes at 95°C. The PCR 
conditions were as follows: Denaturation for 2 minutes at 

15 92°C, annealing for 2 minutes at 55°C, and extension 
for 2 minutes at 92°C, 2 minutes at 50°C, and 
2 minutes at 70°C, repeated five times; then 2 minutes 
at 95°C, 2 minutes at 45°C, and 2 minutes at 70°C, 
repeated thirty times. 

20 Subclonina and Sequencing of Amplified DNA Fragments 

Amplified DNA (20jil) was removed and analysed by gel 
electropheresis in 1% agarose and 3% NuSieve (SeaKem). 
Amplification products 260bp-330bp in lenigth were excised 
from the gel and purified with Geneclean. 

25 DNA fragments were then digested with Hind III for one 
hour at 37°C and EcoRI for one hour at 37°C, the DNA 
again purified with Geneclean and eluted into lOpl 
H 2 0. Digested DNA fragments were then subcloned into 
H13mpl9 and sequenced by the Sanger dideoxy 

30 chain-termination method using the Pharmacia or the 

Promega DNA sequencing kit. Sequencing reactions were 
analysed on a 6% acrylamide, 7M urea gel, dried onto 
Whatman 3M paper, and exposed to X-ray film for sixteen 
hours (Kodak X-OMAT AR5) at room temperature overnight. 

35 Sequence Analysis o f Novel DNA Sequences 
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Sequence analysis of the DNA fragments generated from 
the PCR amplification identified two DNA fragments that 
had sequences common to other known 6 protein-coupled 
C££eptpr8. PCR amplification of neuroblastoma cDNA with 
5 ths: Regenerate oligonucleotides 377 and 305 produced a 

cDNfcrferagment which was designated 3.1. PCR amplification 
of hunraft hippocampal cDNA with the degenerate 
oligonucleotides 377 and 376 produced a cDNA fragment with 
a sequence that was 76% homologous at the nucleotide level 

10 to sequence 3.1 and was designated 3.2 The DNA sequences 
were searched on the GenBank and EMBL databases for 
comparison to known sequences and were confirmed to be 
novel sequences with a high level of homology to dog 
adenosine Al and A2 receptors. 

.15 Isolation of Full-Lenath cDNA Clones 

Full-length cDNA clones encoding the Al receptor as 
well as receptor sequences corresponding to 3.1 and 3.2 
were isolated from a human hippocampal cDNA library 
(Stratagene) . 

20 Al adenosine receptor cDNA isolation 

Specific consensus oligonucleotides corresponding to 
the second extracellular loop (679), and ±o the third 
intracellular loop (678) were synthesised on an Applied 
Biosys terns automated DNA synthesiser. The sequences of 

25 the oligonucleotides are as follows :- 

678 5 - CCC^TAGTACTTCTGCGGGTCGCCA6AGGAGGCGACACCTTCTTGCC-3 ' 

679 5 ' -GAGGCGCAGC6GGCCT6GGCGGCCAACGGCAGCGGCGGCGAGCCCGTG-3 ' 
30 Approximately 5 x 10 5 plaques were plated on C600 

HflA bacterial cells. Plaques were lifted on to 
Hybond-N+nylon filters (0.45jiH, 137mm, Amersham) . DNA 
was denatured on the filters with a 3 minute incubation on 
0.5 M NaOH, 1.5M NaCl and neutralised with a 5 minute 
35 incubation in 0.5M Tris pH72 # ImM EDTA and 1.5M NaCl. DNA 
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was fixed to the filters with a 15 minute exposure to 0.4M 
NaOH. Filters were then rinsed in 2 x SSC (3M NaCl, 0.3M 
sodium citrate) and allowed to dry before a 30 minute 
prehybridisation in 40% formamide, 5 x SSC, 5 x 
5 Denhardfs, 50mM NaPO^, 0.5% sodium dodecyl sulphate 
(SDS), O.lmg/ml salmon sperm DNA at room temperature. 
Oligonucleotides 678 and 679 were pooled and 50 pmoles 
total were radiolabelled using r 32 P -ATP and the DNA 5 1 
end-labelling system (Promega) . The filters were 

10 hybridised with this radiolabelled probe overnight at 
42°C, after which time they were washed once briefly in 
2 x SSC at room temperature then twice for 10 minutes each 
wash in 2 x SSC # 0.1SDS at room temperature with a final 
wash in 0.1 x SSC, 0.1%SDS for 15 minutes at 50°C. The 

15 filters were then exposed to Kodak X-QMAT AR5 film 
overnight at -70°C. Over twenty pure phaige isolates 
which hybridised to the radiolabelled 678 and 679 
oligonucleotides were obtained. Several of these 
different cDNAs were sequenced. The sequence of one such 

20 cDNA (together with the deduced amino acid sequence) which 
encodes the human Al adenosine receptor is shown in 
Figure 1. 

A2a and A2b adenosine receptor cDNA isolation 

Approximately 1 x 10 6 plaques were, plated op 

25 CSOOHflA bacterial cells. Plaques were lifted onto 

Hybond-N nylon filters (0.45/iM, 137mm, Amersham) . DNA 
was denatured on the filters with a 3 minute incubation on 
0.5M NaOH, 1.5M NaCl and neutralised with a 7 minute 
incubation in 0.5M Tris pH 7.2, lmM EDTA and 1.5M NaCl. 

30 Filters were rinsed in 2 x SSC (20 x SSC is 3N NaCl, 0.3H 
sodium citrate) and DNA fixed to the filters with a 5 
minute exposure to ultraviolet light (312nm). Filters 
were prehybridied in 5 x SSFB (5 x SSPE=0.5M NaCl, 0.Q5M 
NaH 2 P0 4 , 0.0005M EDTA, pH 7.7), 5 x Denhardt's (0.1% 

35 (w/v) bovine serum albumin, 0.1% (w/v) Ficoll, 0.1% (w/v) 
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polyvinylpyrollidone) , 0.5% sodium dodecyl sulphate (SDS)/ 
0.2mg/ml salmon sperm DNA at 65°C for 17 hours. The 
filters were hybridised with a radiolabeled probe 
corresponding to the PCR amplified DNA fragment encoding 
5 the 300 bp of 3.1 (labelled with (a- 32 P)-dCTP using the 
random primers DNA labelling system (Bethesda Research 
Laboratories)). Following hybridisation of the 
radiolabelled probe for 20 hours at 65°C, filters were 
washed with 2 x SSPE, 0.1% SDS at room temperature for 10 

10 minutes , then with 1 x SSPB f 0.1% SDS at room temperature 
for 10 minutes and exposed to Kodak X-0MAT AR5 film for 
seven days at -70°C. Two pure phage isolates were 
hybridised to the radiolabelled 3.1 DNA fragment were 
obtained. The two DNA inserts were excised form the phage 

15 vector using EcoRI digestion and subcloned into M13mpl9 

for sequencing. Sequence analysis indicated that one cDNA 
insert of approximately 2.6 kilobases encoded the 
full-length clone for the 3.2 receptor. The sequence of* 
the cDNA (together with the putative amino acid sequence) 

20 insert encoding the 3.1 receptor (the human A2a adenosine 
receptor) is shown in Figure 2 (together with the deduced 
amino acid sequence of the human A2a adenosine receptor) 
whilst the sequence of the cDNA insert encoding the 3.2 
receptor (the human A2b adenosine receptor) is shown in 

25 Figure 3 (together with the deduced amino acid sequence). 
Expression of the clon ed Al, A2a and A2b adenosine 
receptors in mammalian cells 

Each cloned full-length cDNA was subcloned into a 
mammalian cell expression vector (pcDNAlneo for A2a and 

30 A2b and pRc/CMV for Al (Invitrogen) ) in such a way as to 
direct expression of the encoded receptor portion. 

Mammalian cell lines (Chinese Hamster Ovary - CHO Kl 
or Human Embryonic Kidney - HEK 293) were independently 
transfected with the recombinant expression vectors and 

35 cell lines established which had stably integrated the 
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cloned receptor DNA. The stably transfected cell lines 
were examined for their ability to bind a range of 
adenosine analogues as shown in Figure 4. Furthermore, 
the effect on cyclic AMP (cAMP) levels of receptor 
5 activation by adenosine agonists was examined as shown in 
Figure 5. 

These studies demonstrate that cDNA clone 3.1 encodes 
an adenosine A2a receptor, cDNA clone 3.2 encodes an 
adenosine A2b receptor and that the Al cDNA encodes an 

10 adenosine Al receptor. Generation of significant amounts 
of purified receptor protein, made possible by this 
invention, can be used as a tool to facilitate the design 
and chemical synthesis of highly specific agonists and 
antagonists for each receptor subtype. Knowledge of the 

15 primary sequence differences between the related receptor 
subtypes as determined by this invention provides crucial 
information for the design of receptor subtype specific 
agonists and antagonists* 

It will be appreciated by persons skilled in the art 

20 that numerous variations and/or modifications may be made 
to the invention as shown in the specific embodiments 
without departing from the spirit or scope of the 
invention as broadly described. The present embodiments 
are, therefore, to be considered in all respects as 

25 illustrative and not restrictive. 
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CLAIMS 

1. A DNA molecule encoding the human Al adenosine 
receptor , the DNA molecule having a sequence substantially 
as shown in Figure 1 or a functionally equivalent sequence. 
5 2. A DNA molecule encoding the human A2a receptor 

subtype , the DNA molecule having a sequence substantially 
as shown in Figure 2 or a functionally equivalent sequence. 

3. A DNA molecule encoding the human A2b adenosine 
receptor subtype, the DNA molecule having a sequence 

10 substantially as shown in Figure 3 or a functionally 
equivalent sequence* 

4. A method of producing the human Al adenosine receptor 
comprising culturing a cell transformed with the DNA 
molecule as claimed in Claim 1 under conditions which 

15 allow expression of the DNA sequence such that the human 
Al adenosine receptor is expressed on the cell surface and 
optionally recovering the human Al adenosine receptor. 

5. A method of producing a human A2a adenosine receptor 
comprising culturing a cell transformed with the DNA 

20 molecule as claimed in Claim 2 under conditions which 

allow expression of the DNA sequence such that the human 
A2a adenosine receptor is expressed on the cell surface, 
and optionally recovering the human A2a adenosine receptor. 

6. A method of producing a human A2b adenosine receptor 
25 comprising culturing a cell transformed with the DNA 

molecule as claimed in Claim 3 under conditions which 
allow expression of the DNA sequence such that the human 
A2b adenosine receptor is expressed on the cell surface 
and optionally recovering the human A2b adenosine receptor. 
30 7. A method of screening a molecule for adenosine 

agonist or antagonist activity, comprising contacting the 
molecule with the human Al, A2a and A2b adenosine 
receptors produced by the method as claimed in any one of 
Claims 3 to 6. 
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Sequence Range: 1 to 1290 

10 20 30 40 

• # * * 

CGC AGG ATG GIG CTT GOC TOG TGC COC TTG GTG O0C GXC TGC TGA TGT 

SO 60 70 80 90 

* * * # + 

GOC CAG OCT GIG COC GOC ATG COG CCC TCC ATC TCA OCT TIC CAG GCC 

Met Pro Pro Ser lie Sex Ala Phe Gin Ala> 

100 U0 120 130 140 

* * # * * 

GCC TAC ATC GGC ATC GAG GIG CTC ATC GCC CIG GIC TCT GIG CCC GGG 
Ala Tyr lie Gly ne Glo Val Leo lie Ala Leo Val Ser Val Pro Gly> 

ISO 160 170 180 190 

• * * * * 

AAC GTG CIG GTG ATC TGG GCG GIG AAG GIG AAC CAG GCG CTG CGG GAT 
Asn Val Leu val lie Trp Ala Val Lys Val Asa Gin Ala Leo Arq Asp> 

200 210 220 230 240 

GCC ACC TIC TGC TTC ATC GIC TOG CTG GCG GIG GCT GAT GIG GCC GIG . 
Ala Thr Pne Cys Phe lie Val Sex Leu Ala Val Ala Asp Val Ala Val> 

250 260 270 280 

* * • * 

GGT GCC CIG GIC ATC COC CTC GCC ATC CTC ATC AAC ATT GGG CCA GAG 
Gly Ala Leo Val lie Pro Leo Ala lie Leo lie Asn lie Gly Pro Gln> 

290 300 310 320 330 

* * * * * 

ACC TAC TTC CAC ACC TGC CTC ATG GIT GCC TGT GCG GIC CTC ATC CTC 
Thx Tyr Phe His Thr Cys Leo Met Val Ala Cys Pro val Leo lie Leo> 

340 350 360 370 380 

* * # * * 

ACC CAG AGC TCC ATC CIG GOC CIG CTG GCA ATT GCT GTG GAC CGC TAC 
Thr Gin Ser Sex He Leo Ala Leo Leo Ala lie Ala Val Asp Arg Tyr> 

390 400 410 420 430 

* * * ♦ * 

CTC CGG GIC AAG ATC OCT CTC CGG TAC AAG ATG GIG GTG ACC CCC CGG 
Leo Arg Val Lys Ue Pro Leo Arg Tyr Lys Met val Val Thr Pro Arg> 

440 450 460 470 480 

* * * * * 

AGG GCG GCG GTG GCC A3A GOC GGC TGC TGG ATC CTC TCC TTC GTG GTG 
Axg Ala Ala Val Ala lie Ala Gly Cys Trp lie Leo Sex Phe Val V*l> 

FIG.l 
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<90 500 510 520 

* * * * 

Gly Lot Thr Pro Met Phe dy Tip Asn Asn Lot Ser Ala Val du Arg> 
530 540 550 560 

^ A 



* - * 



570 



Ala Trp Ala Ala Asn dy Ser Met dy du Pro Val lie Lys Cys Gln> 



580 590 600 



* 



610 620 



Pta £5 iJf S5 if S ^ ^ *" 610 ■« TO *« « TTT 
*ne Glu Lys Val lie Ser Met Gin Tyr Met Val Tyr Phe Asn Phe Phe> 



«30 640 650 



660 670 



GIG TGG GIG CTG CCC CCG CTT CTC CTC ATG GTC CTC ATC TAG CXG GAG 
Val Trp Val Leu Pro Pro Leo Lea Lot Met Val Leo lie Tyr Lot du> 

680 690 700 710 720 

* * * • * 

GTC TIC TAG CTA ATC GGC AAG CAG CTC AAC AAG AAG GTG TOG GCC TCC 
Val Phe Tyr Lot lie Arq Lys Gin Lot Asn Lys Lys Val Ser Ala Ser> 





730 740 
* * 




750 
* 


760 


TCC GGC 
Ser Gly 


GAC 
Asp 


CCG CAG AAG 
Pro Gin Lys 


TAC 

Tyr 


TAT GGG 
Tyr Gly 


AAG GAG CTG 
Lys Glu Lea 


AAG 
Lys 


ATC GCC AAG 
lie Ala Lys> 


770 
* 




780 
* 


790 
* 


800 




810 
* 


TCG CTG 
Ser Lot 


GCC 

Ala 


CTC ATC CTC 

Lea lie Leu 


TTC 

Phe 


CTC TTT 
Leu Phe 


GCC CTC AGC 

Ala Leo Ser 


TGG 
Trp 


CTG OCT TTG 
Lea Pro Leu> 


820 
* 




830 
* 




840 
* 


850 
* 




860 
* 


CAC ATC 
Bis lie 


CTC 
Leu 


AAC TGC ATC 
Asn Cys lie 


ACC 
Thr 


CTC TTC 
Leu Phe 


TGC CCG TCC 
Cys Pro Ser 


TGC 
Cys 


CAC AAG CCC 
Bis Lys Pro> 


870 
• 




880 
* 




890 
• 


900 
• 




910 


AGC ATC 
Ser lie 


CTT 
Leu 


ACC TAC ATT 
Thr Tyr lie 


GCC 
Ala 


ATC TTC 
lie Phe 


CTC ACG CAC 
Leu Thr His 


GGC 
Gly 


AAC TCG GCC 
Asn Ser Ala> 



PIG.l (cont'd.) 
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920 930 940 950 960 

* * * * * 

ATG AAC COC ATT GTC TAT GCC TTC CGC ATC CAG AAG TTC CGC GTC AOC 
Met Asn Pro lie Val Tyr Ala Phe Arg lie Gin Lys Pbe Arg Val Thr> 

970 980 990 1000 

* * * * 

TTC CTT AAG ATT TGG AAT GAC GAT TTC CGC TGC CAG OCT GCA CCT CCC 
Pbe Leu Lys lie Trp Asn Asp Bis Phe Arq Cys Gin Pro Ala Pro Pro> 

1010 1020 1030 1040 1050 

***** 

ATT GAC GAG GAT CTC CCA GAA GAG AGG CCT GAT GAC TAG ACC CCG CCT 
lie Asp Gin Asp Leu Pro Gin Gin Arg Pro Asp Asp *+*> 

1060 1070 1080 1090 1100 

* * * * * 

TCC GCT CCC ACC AGC CCA CAT CCA GTG GGG TOP CAG TCC AGT CCT CAC 

1110 1120 1130 1140 1150 

* * * * * 

ATG CCC GCT GTC CCA GGG GTC TCC CTG AGC CTG CCC CAG CTG GGC TGT 

1160 1170 1180 1190 1200 

* * * * * 

TGG CTG GGG GCA TGG GGG AGG CTC TGA AGA GAT ACC CAC AGA GIG TGG 

1210 1220 1230 1240 

* * * * 

TCC CTC CAC TAG GAG TTA ACT ACC CIA CAC CTC TGG GCC CTG CAG GAG 

1250 1260 1270 1280 1290 

***** 

GCC TGG GAG GGA AGG GTC CTA CGG AGG GAC CAG GTG TCT AGA 



FIG.l (cont'd.) 
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Sequence Range: 1 to 2575 



10 20 30 40 

# * * * 

CAA ITT TCA GCT GXT CTT TGC TCA ATA ATA ACT TTT TTA TCA CCA AGA 



50 



60 



70 80 90 

* • * * * 

TAT CTC TCT AAG TTT TTG ACA TAT TCC TCA TTT GTT TT6 ATA AAA CTT 

"100 110 120 130 140 

• * * * * 

TTC TTA TTT TCT TAG AAA AAT AAG TTA CTA AAA GTC ATA TAT CAT TCT 



150 



190 



160 170 180 

• * * * * 

ATA TCT TCA AAA TAT TGC TTA AAA CTA GGA CTT GTA TTT AAA TGT TTT 



200 



210 



220 
* 



230 



240 



TTC TTC TTA AAG ACA ATT TGC AGG TGC CCT CAG GAA CCC TGA AGC TGG 



250 



260 
* 



270 
* 



280 



290 



GCT GAG CCA TGA TGC TGC TGC CAG AAC CCC TGC ABA GGG CCT GGT TTC 

300 310 320 330 

* * * * 

AGG AGA CTC AGA GTC CTC TGT GAA AAA GCC CTT GGA GAG CGC CCC AGC 

340 350 360 370 380 

*' * * '* * 

AGG GCT GCA CTT GGC TCC TGT GAG GAA GGG GCT CAG GGG TCT GGG CCC 

390 400 410 420 430 

* * * * * 

CTC CGC CTG GGC CGG GCT GGG AGC CAG GCG GGC GGC TGG GCT GCA GCA 

440 450 460 470 480 

* * * * * 

AAT GGA GCG TGA GCT GGC CCA GCC CGC GTC CCT GCT GAG CCT GCC TGT 



490 



500 



510 



520 



530 



CCT CTG TGG GC ATG CCC ATC ATG GGC TCC TOG CTG TAC ATC ACG GIG GAG 
Met Pro lie Met Gly Ser Ser Val Tyr He Thr val Glu> 



540 



550 



560 
* 



570 



CTG GCC ATT GCT GIG CTG GCC ATC CTG GGC AAT CTG CTG CTG TGC TGG 
Leu Ala lie Ala Val Lea Ala lie Leu Qy Asa Val Leu Val Cys Trp> 

FIG. 2 
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580 590 600 610 620 

* * * * * 

GCC GTC TOG CTC AAC AGC AAC CTG CAG AAC GTC AGC AAC TAC TIT GIG 
Ala Val Trp Leu Asa Ser Asn Leu Gin Asn Val Thr Asn Tyr Phe Val> 

630 640 650 660 670 

* * * * * 

GIG TCA GIG GCG GOG GCC GAC ATC GCA GIG GGT GIG CTC GCC ATC CCC 
Val Ser leu Ala Ala Ala Asp lie Ala Val Gly Val Leu Ala lie Pro> 

680 690 700 710 720 

* * * * * 

TTT GCC AIC AOC ATC AGC ACC GGG TTC TGC GCT GCC TGC CAC GGC TGC 
Phe Ala lie Thr He Ser Thr Gly Phe Cys Ala Ala Cys Bis Gly Cys> 

730 740 750 760 770 

* * * * * 

CTC TTC ATT GCC TGC TTC GTC CTG GTC CTC ACG CAG AGC TCC ATC TTC 
Leu Phe ne Ala Cys Phe Val Leu Vfcl Leu Thr Gin Ser Ser lie Phe> 

780 790 800 810 

* * * * 

ACT CTC CTG GCC ATC GCC ATT GAC CGC TAC ATT GCC ATC CGC ATC COG 
Ser Leu Leu Ala lie Ala ne Asp Arg Tyr lie Ala lie Arg lie Pro> 

820 830 840 850 860 

* * * * * 

CTC CGG TAC AAT GGC TTG GIG ACC GGC ACG AGG GCT AAG GGC ATC ATT 
Leu Arg Tyr Asn Gly Leu Val Thr Gly Thr Arg Ala Lys Gly lie Ile> 

870 880 890 900 910 

* * * - * * 

GCC ATC TGC TGG GIG CTG ICG TTT GCC ATC GGC CTG ACT CCC ATG CIA 
Ala lie Cys Trp Val Leu Ser Phe Ala lie Gly Leu Thr Pro Met I*eu> 

920 930 940 950 960 

* * * * * 
GGT TGG AAC AAC TGC GGT CAG CCA AAG GAG GGC AAG AAC CAC TCC CAG 
Gly Trp Asn Asn Cys Gly Gin Pro Lys Glu Gly Lys Asn Bis Ser Gln> 

970 980 990 1000. 1010 

* * " * * * 
GGC TGC GGG GAG GGC GAA GIG GCC TGI CTC TTT GAG GAT GIG GTC CCC 
Gly Cys Gly Glu Gly Gin Val Ala Cys Leu Phe Glu Asp Val Val Pro> 

1020 1030 1040 1050 

* * * * 

ATG AAC TAC ATG GIG TAC TTC AAC TTC TTT GCC TGT GIG CTG GIG CCC 
Met Asn Tyr Met Val Tyr Phe Asn Phe Phe Ala Cys Val Leu Val Pro 

1060 1070 1080 1090 1100 

***** 

CTG CTG CTC ATG CTG GGT GTC TAT TTG CGG AIC TTC CTG GCG GCG CCA 
Leu Leu Leu Met Leu Gly Val Tyr Leu Arg lie Phe Leu Ala Ala Arg> 

1110 1120 1130 1140 1150 

* * * * * 

CGA CAG CTG AAG CAG ATG GAG AGC CAG CCT CTG CCG GGG GAG CGG GCA 
Arg Gin Leu Lys Gin Met Glu Ser Gin Pro Leu Pro Gly Glu Arg Ala> 

FIG* 2 (cont'd-) 
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H60 1170 1180 UM 1M0 

COG TCC ACA CTG CAG AAG GAG GTC CAT GCT GCC AAG TCA CIG GCC MC 
Axg Ser Tta Lea Gin lys Gin Val Bis Ala Aim Lys Ser Len Ala Xle> 

1210 1220 1230 1240 1250 

J * * * # 

1260 1270 1280 1290 

TGC tTC ACT TTC TTC TGC CCC GAC TGC AGC CAC GCC CCT CTC TGG CXC 
C^PheTteFhaFheCyaFroAapCysSerHiaAlaPxoLeoTrp Lea> 

1300 1310 1320 1330 1340 

ATG TAC CTG GCC ATC GTC CTC TCC OC ACC AAT TO GIT GIG AAT CCC 
^^X«AlaneV^J^ScrBlsTteAsaSerViavalAsnPro> 

1350 1360 1370 1380 1390 

TTC ATC TAC GCC TAC CGT ATC CGC GAG TTC CGC OG ACC TTC CGC AAG 
Phe lie Tyr Ala Tyr Arq ne Arq Gin Pfce Axq Gin Thr Phe Arq Lys> 

1400 1410 1420 1430 1440 

• * ■ * * ' * * 
ATC ATT CGC AGC CAC GTC CIG AGG CAG OVA GAA CCT TTC AAG GCA GCT 
lie He Arg Ser Bis Val Lea Arg Gin Gin Gin Pro Fhe Lys Ala Ala> 

1450 1460 1470 1480 1490 

* * * * 

GX ACC AGT GCC CGG GTC TIG GCA GCT CAT GGC AGT GTC GGA GAG CAG 
Gly Thr Ser Ala Arg Val Leu Ala Ala His Gly Ser Val Gly Gin Gln> 

1500 1510 1520 1530 

* * * * 

GTC AGC CTC CGT CTC AAC GGC CAC CCG CCA GAG GIG TGG GCC AAC GGC 
val Ser Leu Arg Len Asa Gly Bis Pro Pro Gin Vial Trp Ala Asa Gly> 

1540 1550 1560 1570 1580 

* * * # * 

AGT GCT CCC CAC OCT GAG CGG AGG CCC AAT GGC TAC GCC CTG GGG CTG 
Ser Ala Pro His Pro Gin Arg ArgPro Asa Gly l*r Ala Len Gly Letf> 

1590 1600 1610 1620 1630 

* * * * * 

GTG AGT GGA GGG AGT GCC CAA GAG TCC CAG GGG AAC ACG GGC CTC CCA 
Val Ser Gly Gly Ser Ala Gin Gin Ser Gin Gly Asa Thr Gly Len Pro> 

1640 1650 1660 1670 1680 

* * * * * 

GAC GIG GAG CTC CTT AGC CAT GAG CTC AAG AGA GIG TGC CCA GAG CCC 
Asp val Gin Len Len Ser Bis Gin Len Lys Arg val Cys Pro Gin Pro> 

1690 1700 1710 1720 1730 

***** 

CCT GGC CIA GAT GAC CCC CIG GCC CAG GAT GGA GCA GGA GTG TCC TGA 
Pro Gly Len Asp Asp Pro Len Ala Gin Asp Gly Ala Gly val Ser ***> 

1740 1750 1760 1770 

* * * * 

TGA TTC ATG GAG TTT GCC CCT TCC 1AA G GGA AGG AGA TCT TIA TCI TTC 
— Phe Met Gin Phe Ala Pro Ser *++> 

FIG. 2 (cont'd.) 
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1780 1790 1800 1810 . 1820 

* * * * • 

TGG TTC GCT TGA OCX GXC ACG TIG GGA GAA GAG AA CM TGC CAG QfcG 

1830 1840 1850 1860 1870 

• * * * • 

ACC CXG AGG GCA GOC GGT TCC TAC TTT GGA CIG AGA GAA GGG AGC COC 

1880 1890 1900 1910 1920 

* * * * * 

AGG CXG GAG CAG CAT GAG GOC CAG CM GAA GGG CTT GGG TTC TGA GGA 

1930 1940 1950 1960 1970 

* * # • • 

AGC AGA TGT TTC AXG CIG TGA GGC CTT GCA CCA GGT GGG GGC CAC AGC 

1980 1990 2000 2010 

* * * # 

AOC AGC AGC ATC TTT GCT GGG CAG GGC OCA GOC CXC CAC TGC AGA AGC 

2020 2030 2040 2050 ' 2060 

* * # * * 

ATC TGG AAG CAC CAC CTT GIC TCC ACA GAG CAG CIT GGG CAC AGC AGA 

2070 2080 2090 2100 2110 

***** 

CIG GOC TGG COC TGA GAC TGG GGA GIG GCT CCA ACA GOC TCC TGC CAC 

2120 2130 2140 2150 2160 

* * * * * 
OCA CAC AOC ACT CTC OCT AGA CXC TCC TAG GGT TCA GGA GCT GCT GGG 

2170 2180 2190 2200 2210 

* * * * * 
COC AGA GGT GAC ATT TGA CIT TTT TTC CAG GAA AAA TGT AAG TGT GAG 

2220 2230 2240 2250 

* * * • 

GAA AOC CTT TTT ATT TTA TTA CCT TTC ACT CXC TGG CIG CIG GGT CIG 

2260 2270 2280 2290 2300 

* * * * * . 
COG TOG GIC CXG CXG CIA ACC TGG CAC CAG AGC CXC TGC COG GGG AGC 

2310 2320 2330 2340 2350 

* * * * • 

CXC AGG CAG TCC TCT CCT GCT GIC ACA GCT GCC ATC CAC TIC TCA GXC 

2360 2370 2380 2390 2400 

* * * « • 

CCA GGG OCA TOT CXT GGA GXG ACA AAG CIG GGA TCA AGG ACA GGG AGT 

2410 2420 2430 2440 2450 

* * * • * 

TGT AAC AGA GCA GXG OCA GAG CAT GGG COC AGG TCC CAG GGG AGA GGT 

2460 2470 2480 2490 

* • « * 

TGG GGC TGG CAG GCC ACT GGC AXG TGC TGA GXA GCG CAG AGC TAC OCA 

2500 2510 2520 2540 

* * * « • 

GXG AGA GGC CTT GIC TAX CIG CCT TTC CTT CIA AAG GGA AXG TTT TTT 

2550 2560 2570 

TCT GAG AXA AAA TAA AAA OGA GOC ACA G p IG . 2 ( CO nfd.) 
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Sequence Range: 1 to 1687 

10 20 30 40 

* * * * 

AGC 00C GAG GCT CAG AAG CGG CAG GOG GAG GCG CGG TOC GGG OGC 

60 70 80 90 

# * * * 

TAT GGC CAT GCC CGG CGG GTC TCA CGC GGC TGC CCC TOG CCC GGC GCG 

100 UO 120 130 140 

• # • * * 

CCT TOG GTA GGG GGC GCC CGG GGC CCA GCT GGC CCG GCC ATG CTG CTG 

Met Leu Leu> 

150 160 170 180 190 

* * * * * 

GAG ACA CAG GAC GCG CTG TAC GTG GCG CTG GAG CTG GTC ATC GCC GCG 
Gin Thr Gin Asp Ala Leu Tyr Val Ala Leo Gin Leu Val lie Ala Ala> 

200 210 220 230 240 

* * * * * 

CTT TOG GIG GCG GGC AAC GTG CTG GTG TGC GCC GCG GTG GGC ACG GCG 
Leu Ser Val Ala Gly Asn Val Leu Vial Cys Ala Ala Val Gly Thr Ala> 

250 260 270 280 

■ * * * * 

AAC ACT CTG CAG ACG CCC ACC AAC TAC TTC CTG GTG TGC CTG GCT GCG 
Asn Thr Leu Gin Thr Pro Thr Asn Tyr Phe Leu Val Ser Leu Ala Ala> 

290 300 310 320 330 

* * * * # 

GCC GAC GTG GCC GTG GGG CTC TTC GCC ATC CCC TTT GCC ATC ACC ATC 
Ala Asp Val Ala Val Gly Leu Phe Ala lie Pro Phe Ala lie Thr Ile> 

340 350 360 370 380 

* * * * * 

AGC CTG GGC TTC TGC ACT GAC TTC TAG GGC TGC CTC TTC CTC GCC TGC 
Ser Leu Gly Phe Cys Thr Asp Phe Tyr Gly Cys Leu Phe Leu Ala Cys> 

390 400 410 420 430 

# * » • * * 

TTC GTG CTG GTG CTC ACG CAG AGC TCC ATC TTC AGC CTT CTG GCC GTG 
Phe Val Leu Val Leu Thr Gin Ser Ser lie Phe Ser Leu Leu Ala Val> 

440 450 460 470 480 

* * * * * 

GCA GTC GAC AGA TAC CTG GCC ATC TGT GTC CCG CTC AGG TAT AAA ACT 
Ala Val Asp Arg Tyr Leu Ala He Cys Val Pro Leu Arg Tyr Lys Ser> 

490 500 510 520 

* * # * 

TTG GTC ACG GGG ACC CGA GCA AGA GGG GTC ATT GCT GTC CTC TGG GTC 
Leu Val Thr Gly Thr Arg Ala Arg Gly Val lie Ala Val Leu Trp Val> 

PIG. 3 
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530 540 550 560 570 

* * * * + 

CTT GOC TTT GGC ATC GGA TTG ACT CCA TTC CTG GGG TOG AAC ACT AAA 
Leu Ala Phe Gly lie Gly Lea Thr Pro Phe Leo ay Trp Asa Ser Lys> 

580 590 600 610 620 

* * * * * 

GAC ACT GOC AOC AAC AAC TGC ACA GAA COC TOG GAT GGA AOC AOG AAT 
Asp Ser Ala Thr Asn Asn Cys Thr Glu Pro Trp Asp Gly Thr Thr Asn> 

«30 640 650 660 670 

* * * * * 



CTT GTG AAG TGT CTC TTT GAG AAT GTG GTC COC ATG 
Lea Val Lys Cys Lea Phe Gin Asn Val Val Pro Met> 



GAA AGC TGC TGC 
Gin Ser Cys Cys 

680 

AGC TAC ATG GXA 
Ser Tyr Met Val 

730 
* 

CTT A1A ATG CTG 
Leo lie Met Leu 



690 



700 710 



720 



TAT TTC AAT TTC TTT GGG TGT GTT CTG COC CCA CTG 
Tyr Phe Asa Phe Phe Gly Cys Val Leu Pro Pro Leu> 

740 750 760 

* * * 

GTG ATC TAC ATT AAG ATC TTC CTG GTG GOC TGC AGG 
Val He Tyr lie Lys lie Phe Lea Val Ala Cys Arg> 



FIG. 3 (cont*d.) 



770 780 790 800 810 

* * * * * 

GAG CTT CAG CGC ACT GAG CTG ATG GAC GAC TOG. AGG AOC AOC CTC GAG 
Gin Lea Gin Arg Thr Glu Lea Met Asp Bis Ser Axg Thr Thr Lea Gln> 

820 830 840 850 860 

* * * * * 

CGG GAG ATC CAT GCA GOC AAG TCA CTG GCC ATG ATT GTG GGG ATT TTT 
Arg Glu lie His Ala Ala Lys Ser Lea Ala Met lie Val Gly lie Phe> 

870 880 890 900 910 

* ♦ * * * 

GOC CTG TGC TGG TTA OCT GTG CAT GOT GTT AAC TGT GTC ACT CTT TTC 
Ala Lea Cys Trp Lea Pro Val Bis Ala Val Asn Cys Val Thr Leo Phe> 

920 930 940 950 960 

■ « * * * * 

CAG CCA GCT CAG GGT AAA AAT AAG CCC AAG TGG GCA ATG AAT ATG GCC 
Gin Pro Ala Gin Gly Lys Asn Lys Pro Lys Trp Ala Met Asn Met Ala> 

970 980 990 1000 

ATT CTT CTG TCA CAT GCC AAT TCA GTT GTC AAT CCC ATT GTC TAT GCT 
He Leo Leo Ser Bis Ala Asn Ser Val Val Asn Pro lie Val Tyr Ala> 

1010 1020 1030 1040 1050 

* * # * * 

TAC CGG AAC CGA GAC TTC CGC TAC ACT TTT CAC AAA ATT ATC TCC AGG 
Tyr Arg Asn Arg Asp Phe Arg Tyr Thr Phe His Lys lie ne Ser Arg> 

1060 1070 1080 1090 1100 

* + + • * 

TAT CTT CTC TGC GAA GCA GAT GTC AAG ACT GGG AAT GGT CAG GCT GGG 
Tyr Lea Lea Cys Gin Ala Asp val Lys Ser Gly Asn Gly Gin Ala Gly> 
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1110 1120 1130 1140 1150 

* * * * * 

GIA GAG OCT GCX CXC G6T CTG GGC CIA TGA TCT AGG CIC TCG CCT CTT 
vai Gin Pro Ala Len Gly Val Gly Lea ***> 

1160 1170 1180 1190 1200 

***** 
CCA GGA GAA GAT ACA ART OA CAA GAA ACA AAG AGG ACA CGG CTG GIT 

1210 1220 1230 1240 

* * * * 

HC ATT GIG AAA GAT AGC TAC ACC TCA CAA GGA AAT GGA CTG CCT CTC 

12S0 1260 1270 1280 1290 

_*_ * * * * 

TIG AGC ACT TCC CTG GAG CSV CCA CGT ATC TAG CTA ATA TGT ATG TGT 

1300 1310 1320 1330 1340 

* * • * » 

CAG TAG TAG CAC CAA GGA TIG ACA AAT ATA TTT ATG ATC TAT TCA GCT 

1350 1360 1370 1380 1390 



* 



* 



GCT TIT ACT GIG ICG AIT ATG CCA ACA GCT TGA AIG GAT TCI AAC AGA 

1400 1410 1420 1430 1440 

■# * * * * 

CTC TIT TGT TTT TAA AAG TCT GCC TIG TTT ATG GIG GAA AAT TAC TGA 

1450 1460 1470 1480 

* * * * 

AAC TAT TTT ACT GIG AAA CAG TGT GAA CIA TTA TAA TGC AAA TAC TTT 

1490 1500 1510 1520 1530 

* • * * * 

TTA ACT TAG AGG CAA TGG AAA AAT AAA AGT TGA CTG TAC TAA AAA TGT 

1540 1550 1560 1570 1580 

■ * •* * * * 

ATA CTT GTT GCC AGG AAG GIG ACC TCA AAA ATT AAA AGT ATA ATT ATT 

1590 1600 1610 1620 1630 

* * * * * 

CGG COG GGC ATG GIG GCT CAC ACC TGT AAT TCC AGC ACT TIG GGA GGC 

1640 1650 1660 1670 1680 

* * * * * 

CAA GGC AGG CGG ATC ACG AGG TCA GGA GTT CAA AAC CAG CCT GTC CAA 
TAT AGT G 

FIG. 3 (cont'd.) 
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